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Abstract 

We consider the coordinated downlink beamforming problem in a cellular network with the base stations (BSs) equipped 
with multiple antennas, and with each user equipped with a single antenna. The BSs cooperate in sharing their local 
interference information, and they aim at maximizing the sum rate of the users in the network. A set of new lower bounds 
(one bound for each BS) of the non-convex sum rate is identified. These bounds facilitate the development of a set of 
algorithms that allow the BSs to update their beams by optimizing their respective lower bounds. We show that when 

O , 

there is a single user per-BS, the lower bound maximization problem can be solved exactly with rank-1 solutions. In this 
case, the overall sum rate maximization problem can be solved to a KKT point. Numerical results show that the proposed 
' algorithms achieve high system throughput with reduced backhaul information exchange among the BSs. 

I. Introduction 



Multiple Input - Multiple Output (MIMO) communications (T) have been adopted in many recent wireless standards, 
such as IEEE 802.16 (2) and 3GPP LTE [3 |, in the aim of boosting the data rates provided to the customers. A promising 
solution to achieve spectrally-efficient communications is the universal frequency reuse (UFR) scheme, in which all cells 
operate on the same frequency channel. However, the downlink capacity of the conventional cellular systems with UFR 
is limited by inter-cell interference. As a result, it is necessary to introduce coordination among the base stations (BSs) 



> 

m 

so that they can jointly manage the tnterferences in all cells to improve the system performance ffl. Such coordination 
technique among the BSs in the downlink is also known as network MIMO H) or Coordinated multipoint (CoMP) |6). 
Some other approaches in the literature have exploited less complex linear schemes, such as Block Diagonalization (BD) 
Q or MMSE [8]. The main drawback of all these systems is that they require channel state information (CSI) and 
transmit data simultaneously known to all cooperating BSs, with the cost of increased signal overhead. Some recent 
approaches have been proposed to avoid CSI and data sharing. Non-coherent joint processing |9) does not require cell- 
to-cell CSI exchange at the expense of higher processing cost at the receivers with successive interference cancelation. 
In ifTOl . the authors analyze the case of distributed cooperation where each BS has only local CSI. 

In this correspondence we consider a cellular scenario with an arbitrary number of multiantenna transmitters (the BSs) 
and single-antenna receivers (the users). We focus on an intermediate approach where the BSs optimize the downlink 
throughput with only the CSI information. Since channel variations are much slower than that of data, the amount and 
the frequency of information exchange is greatly reduced. 

Unfortunately, the sum rate maximization problem is non-convex and thus is difficult to solve efficiently. The authors of 
ifm propose to solve the single cell downlink rate maximization problem first (with dirty paper coding (DPC) and zero- 
forcing (ZF) precoding), and then impose interference limit to the users on the cell edges. In this case, the interference 
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limits to the users are set in a rather heuristic fashion, and the BSs are not coordinating their beamforming. References 
|[T2l and lfl3l are two recent works that propose heuristic algorithms that try to provide solutions to similar problems by 
directly solving the non-convex optimization problem. 

In this correspondence we provide theoretical insights to the coordinated downlink beamforming problem by identifying 
a set of lower bounds (one bound per BS) of the non-convex system sum rate. The benefits of such per-BS lower bounds 
are twofolds: 1) the individual BSs can distributedly optimize their respective lower bounds instead of jointly optimizing 
the original system sum rate to approach a solution to the sum rate maximization problem; 2) individual BSs can monitor 
the improvement of the total sum rate by evaluating their respective lower bounds. Utilizing this set of lower bounds, we 
propose algorithms for the BSs to coordinately optimize their beams. In a special case where each cell has a single user, 
each lower bound becomes concave, and we show that the lower bound maximization problem can be solved exactly. This 
result allows us to obtain a stationary solution of the original sum rate maximization problem. In the general case with 
multiple users per cell, we propose an algorithm that extend the Iterative Coordinated Beamforming (ICBF) algorithm 
proposed in fl3\ , with important difference that the BSs act sequentially instead of simultaneously, and there is no "inner 
iteration" needed. The simulation results show that the proposed algorithms have similar sum rate performance as the 
ICBF algorithm, while requiring significantly less information exchange among the BSs in the backhaul network. 

The correspondence is organized as follows. In section [n] we give the system description, and provide a general lower 
bound for each user. In section ITTT1 and HVl we propose algorithms for the BSs to compute their beamformers in different 
network configurations. In section [V] we provide numerical results to demonstrate the performance of the proposed 
algorithms. This correspondence concludes in Section [VT] 

Notations: For a symmetric matrix X, X y signifies that X is positive semi-definite. We use Tr(X), |X|, X H , X^ 
and Rank(X) to denote the trace, the determinant, the hermitian, the pseudoinverse, and the rank of a matrix, respectively. 
\X]i,i denote the (i, i)th element of the matrix X. I n is used to denote a n x n identity matrix. We use [y, x_»] to denote 
a vector x with its i th element replaced by y. We use R JVxM and <C NxM to denote the set of real and complex N x M 
matrices; We use and to denote the set of N x N hermitian and hermitian semi-definite matrices, respectively. 
Define M t = {(M + 1) mod t} + 1 as an integer taking values from 1, ■ ■ • , M. 

II. Problem Formulation and System Model 

We consider a multi-cell cellular network with a set M. = {1, • • • , M} of base stations (BSs)/cells; each BS is equipped 
with K m transmit antennas; each cell m has a set Af m of distinctive users; let TV denote the set of all users, and each 
user is equipped with a single receive antenna. We use (m, i) and — (m, i) to denote the ith user in mth cell and all the 
users except user (m,i), respectively. Without loss of generality, we assume that all the cells have the same number of 
users, and all the BSs are equipped with the same number of antennas: \Af m \ = N, K m = K. V m € M.. The signal 
x m G C transmitted by BS m is x m = YliieN w m,i^m,j> where b m ,i is the complex information symbol sent by BS 
m to user i € Af m , using beam vector w m .; G C A . Assume E[\b m: i\ 2 ] = 1, for all (m, i) and E[b mi ib*j] = 0, for all 
(m, i) 7^ (q,j)- Assume that each BS m G Ai has a total transmission power constraint: J^ieM ll w m,i|| 2 < Pm- Let 
hg jmi G C K denote the complex channel between the qth BS and the ith user in mth cell. Let n m i G C denote the 
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circularly-symmetric Gaussian noise with variance c m ^. The signal received by a user (m,i) can be expressed as 

(1) 



Intra-cell Interference Inter-cell Interference 

The rate achievable for user (m,i) is given by 



U m ,,(w m ,,w_ M )=log 1+- — -^j— — (2) 

/ h H W h \ 
= ]ogll + — — = I =Jlm,i(W m , <J W_ (m , <) ) (3) 

where W m ,j = w^iW^ is the transmission covariance of user (m,i), and H m . mi = h m . mi h^ m . is the channel 
matrix. Clearly, W TOj j >; and Rank(W m .j) = 1. Define the total interference plus noise at user (m, i) as 

= Cm.j + Wm,jHm,m» W m ,j + w "jH 9 ,m;W 9 ,j = 7 m ,;( w -(m,i))- ( 4 ) 

We assume that / m .j(W_( m ^) is perfectly known at the user (m, i) and the BSs m, but not the neighboring BSs. As 
suggested by Q, this interference plus noise term can be estimated at each mobile user by various methods, and fed 
back to its associated BS. Define the collection of matrices W m = {W m> i}i£Af m , W_ m = {W q ,j}jeN „,q-£m, an d W = 
{~W m }meM< men me sum rate °f a U users in cell m can be expressed as: R m (W m , W_ m ) = S»eAf ^m,i(W TOi i, W_( mi j)). 
The sum rate of all users in the network, is R(W) = ^2 qeM i?g(W g , W_ g ). We are interested in the following non- 
concave sum rate maximization problemj: 

max R(W) (SRM) 
w 



s.t. Tr 



<Pm, v m e M 



W m ,i y 0, Rank(W m ,i) < 1, V (m,i). 

We mention that all the following discussions are equally applicable to the problem of weighted sum rate optimization, 
in which there is a set of non-negative weights associated to the users' rates in the objective. However, we mainly 
consider the (SRM) problem for simplicity of presentation. 

In order to approach the problem (SRM), we first establish some useful results that characterize the users' rate (01. 

Proposition 1: For all (q,j) ^ (m,i), flm,j(W mi i, W_( mi j)) is a convex function ofW q j on and a concave 
function ofW m ,i on S+. 

Proof: In order to show the convexity result, it is sufficient to prove that whenever D G E> K , D/0 and W g j +tD y 
0, the following function is convex in t lil4l Chapter 3] 



Rm,i(t) = log 1 H -= j-3 — — — — r-g — p== rprrr- I • (5) 

y C m ,i + Z^(p,l)^(q,j),(p,l)^(m,i) "p.m; VV pjllp.m, + n ?>m . W q> j + tU)Il, im ^ 

'This problem can also be expressed in an equivalent vector form, with {w m } m6j vt as design variables. 
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Let us simplify the expression a bit by defining the constant c = m W m .ih m mi > (note that ~W m ,i h 0). The 
first and the second derivatives of R rn ,i{t) w.r.t. t can be expressed as 

dRm,i(t) _ l/ln(2) chf mi Dh 9 , mi 



dt {Im.i( w -( m .i)) + th» mi 'Dh q . rni + c) (l mii (W_( m ,i)) + <h» m .Dh s>mi ) 

d 2 R m At) _ l/ln(2) c(h^ m ,Dh, 



(6) 



) 2 



dt 2 (/ m ,i(W_ (m , i) ) + thf m .Dh a (W_( mi i)) + th* m . Dh ?imj 

l/ln(2) c(h^ m ,Dh,, m ,) 2 

/m,i(W_ (m , i) ) + th|f m .Dh,, mi + c (im,i(W_( mii )) + th» mi Dh,, mi ) 2 ' 

Clearly I m ,i{"W-( m ,i)) + ih^ mj Dh g ,„ ls > for all W 9J + iD >; 0. We also have that h^.Dhq,^ is real and 

(h^ m .Dh 9 . mi ) 2 > 0, due to the assumption that D 6 § A , and the subsequent implication that (h^ m .Dh g lTli ) H = 

h^ m .Dh 9jjrii . We conclude that whenever D £ S K andW f/ j+tD >z 0, d R ^ t i^ > 0, which implies that R m ,i(W m ,i,'^-(m,,i)) 

is convex in W g j for all (q,j) ^ (m,i). 

The fact that i? m ,;(W m ,i, W_( m j)) is concave in W mj i can be shown similarly as above. ■ 
Note that the above property is only true in the space of co variance matrix W m , but not in the transmit beamformer 
space w m . This convex-concave property of the individual users' transmission rate is instrumental in deriving a set of 
lower bounds for the system sum rate. For a particular user (m, i), the system sum rate i?(W) can be expressed as 

R(W) = JU,i(W m ,i,W_ (m ,i))+ Yl ^.j(Wm,i,W_ (mii) ). (8) 



concave in W„ 



Defined R-( m ,i) (W) = Y^<qj)M m i) ^i>3 (W)- We can ^ n< ^ a l° wer bound for i?(W) by linearizing the R_( m ,i) (W) 
with respect to W m ,j around a fixed W. Utilizing the fact that R-( m .i) (W) is convex in W TOl ,, we obtain 

£ i^j (W mii , W_ (mii) ) > R- {m>i) (w) - ^ Tr [t„j (w) H m , 9j (W m ,< - W m ,<)] (9) 

with T 9 , (W) 4 ^ „ W -"- > 0. (10) 

Let us define a concave function of W m .j 

C/m,i(W m ,«,W_ (mii) ) 4 ^(Wb^W-m) + i?_ (mil) (w) - ^ Tr [t sJ (w) H m , 9j (W m ,» - W m ,i)] . 

Then from (0, © and the definition of U m: i( ), we must have 

[/m,i(W B ,i,W- (n , i) )a(W n , i ,W- M ), VW m , l > : (11) 

where the equality is achieved when W m = W m j. We refer to this lower bound as the "per-user" lower bound, 
as it is defined w.r.t. each user (m, i). Such lower bound is useful, because if we can find a W* n , that satisfies 
Cm.ifW^, W_( m>i )) > fm,»(W m ,i, W_( m)i j), then the system sum rate must increase, as 

R(W* m ,i,W-( m ,i)) > U m ,i(W* m>i ,W_ (m>i) ) > C/ m ,i(W m , i ,W_ (m , li) ) = fl(W mii , W_ (mii) ). (12) 



III. Multi-cell Network with Single User In Each Cell 

We first consider an important scenario in which each BS transmits to a single user. This scenario may arise in a 
heterogeneous network when each BS transmits to a relay in its cell. As there is a single user in each cell, we simplify 
the notation by using U m (.), T q (.), I m {-) instead of U m .i(.), T q ^(.) and I m ,i(-), respectively. We use W,„ to denote 
the covariance of BS m to its user; we use H m g to denote the channel between BS m to the user in the cell of BS 
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q. Notice that the per-user bound identified in Section HI1 becomes per-BS bound, as each BS has a single user in this 
scenario. For simplicity, define J2qjt m Tq (^J ^m,! = b 0, then the per-BS bound can be expressed as: 

t/ m (W m ,W_ m ) ^Rm(W m ,W- m )+R-m (w) -Tr[A m (W m -W m )] . (13) 

Define the feasible set for BS m as J 7 ,,, = {W TO : Tr [W m ] < p m , W m >; 0, Rank(W TO ) < 1}. The idea is to let 
the BSs take turns to optimize their respective lower bounds {U m (.)}. Assuming other BSs' transmissions are fixed as 
W_ m , the Lower Bound Maximization problem (LBM) for BS m is 

max f/ m (W ro ,W- m ). (LBM) 
w m e.F m 

Notice that after relaxing the rank constraint, the problem (LBM) is a concave problem in the variable W TO . In the 
sequel, we will refer to the problem (LBM) without the rank constraint as (R-LBM), and define its feasible set as 
= { W ™ : Tr [W m ] < p m , W rn h 0}. 
The problem (R-LBM) is a concave determinant maximization (MAXDET) problem lfT51 . and can be solved efficiently 
using convex program/SDP solvers such as CVX [16]. However, in practice such general purpose solver may still induce 
heavy computational burden. Moreover, the resulting optimal solution of the relaxed problem may have rank greater 
than one. Fortunately, these difficulties can be resolved. We have found an explicit construction that generates a rank- 
1 solution of the problem (R-LBM) (hence the optimal solution of problem (LBM)). The rank reduction problem of 
downlink beamforming has been recently studied in IfTTI , |[T8l and IfTTI . However the algorithms proposed in those works 
cannot be directly used to obtain a rank-1 solution to (LBM): reference IfTTI considers problems with linear objective 

functions; references IfTTI and lfl"8l consider the relaxation of the MAXDET problem without the linear penalty terms 0. 

Removing all the terms in the objective of (R-LBM) that are not related to W m , we can write the partial Lagrangian 
of the problem (R-LBM) as 



L(W m , P-m) = log 



— - Tr[(A m + At„I)W ro ] + A*mPm (14) 

Jm(W-m) 



where \i m > is the Lagrangian multiplier associated with the power constraint. Notice the fact that A r „ >: 0, then for 
any \i m > 0, we can perform the Cholesky decomposition A m + [i m I = ~L H L, which results in Tr[(A m + /j„,I)W,„] = 
Tr[LW m L H ]. Define W m (p m ) = LW m L H , we have 

1 



L(W m ,fi m ) = log 



I + L- x W m ( Mm )L- 



im(W- 



Tr[W m (/i m )] + UmPr, 



( = 5 log |l + W„ i ( Mm )VAV Jf | - Tr[W m (^ m )] + fi m p m 
( = } log |l + W ro (/x m )A| - Tr[VW m (^„,)V H ] + Mm p m 

= log jl + W m (/i m )A - Tl'[W m (/i m )] + fj, m pm = L (W OT (flm ) ) (15) 

where in (a) we have used the eigendecomposition: L _ff H m m L _1 =i = VAV fl ; in (b) we have defined 

W m (/i m ) = V H W m (n m )V. Let W^ium) denote an optimal solution to the problem max^,, yQ i(W m (/i m )). 

We claim that there must exist a W^(M m ) that is diagonal. Note that Rank(H m m ) = 1 implies Rank(A) < 1. Thus 
W* n (/i m )A has at most a single column. This implies that we can remove the off diagonal elements of I + WJ n (/z m )A 



without changing the values of 



I + W^ m )A 



Consequently, for any given W* Tl (/x m ), we can construct a diagonal 



optimal solution W^ £) (/i m ) by removing all its off diagonal elements. This operation removes all the off diagonal 

2 With lineal' penalty in the form of — Tr |^A m (W m — W m>q )J , equation (43) is no longer equivalent to equation (44) in 1181 . 
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I + W* Tl {fjL m ) A or Tr[W,^ (/i m )] . Consequently W^ D (fi m ) 



elements of 1 + W* l (/^ r „)A, and it does not change either 
is also optimal. When restricting W* n (/!,„) to be diagonal, we can find its closed-form expression 



,if[A] M /0; [W,* n ( Mn 



0, otherwise, 



(16) 



where [x] + = max{0, x}. Then we can obtain W* n (^ m ) = L~ 1 VW* n ([i m )V H ~L~ H . Combining the fact that 
Rank(A) < 1 with ( TToT l we conclude Rank(W*(/i TO )) < 1, and consequently Rank(W,^ t (/i„ l )) < I, for any \i m > 0. 

It is relatively straightforward to show that TrfW*^^)] is strictly decreasing with respect to fj, m . Consequently if 
the optimal multiplier [i* m > 0, then a bisection method can be used to find [i* m that satisfies the feasibility conditions 
Tr[W^(^)] < p m . Furthermore, we can also show that when /.**„ = 0, A m must have full rank. In this case, we can 
find the Cholesky decomposition A,„ = LL H , and the above construction can still be used to directly obtain W^(0) 
(without bisection), that satisfy Rank(W* Tl (0)) < 1. 

In conclusion, for any [i* m > 0, we obtain RankCW*^^)) < 1. Table U summarizes the above procedure. 

TABLE I 
The Optimization of (LBM) 



51) Choose /LiJJ, and fj} m such that /i* n lies in [n l m , /iJJ,]. 

52) Let /C ld = 04 
L^L = A™ + 



+ fi^ n )/2. Compute decomposition: 

midj 



VAV H = L" 



'H 



-1 



1 



r m (W_ m ) 

53) Compute W^(^™ w ) by {16}. 

54) Compute W^(tt™ W ) = L : VW 



0iS id )V H L- H . 
li™ d \ otherwise let ^t" 



55) If TrCW^GC 4 d )) > Pm, let /4n 

56) If |Tr(W^(M^ <d )) - Pm| < e or |/x£ - Mml < stop; otherwise go to S2). 



In the following, we identify a special structure of the problem (R-LBM) that allows it to admit a rank-1 solution. 
To this end, we tailor the rank reduction procedure (abbreviated as RRP) proposed in [17] to fit our problem 0. Assume 
that using standard optimization package we obtain an optimal solution W* n to the convex problem (R-LBM), with 
Rank(W*„) = r > 1. Let Wm = W*^ and let = r. At iteration t of the the RRP, we perform a eigen 
decomposition W$ = V^VW*, where g c^ xr(t> . If R® > 1, find G S r( " such that the following three 
conditions are satisfied 

Tr ( D W V (*) H H m . m V(*') =0 (17) 

Tr(D( f »V(^A m VW) = (18) 

Tr(D (i) V w V») = 0. (19) 

If such D'*) cannot be found, exit. Otherwise, let A(D(*)) be the eigenvalue of with the largest absolute value, 
and construct W r ( * +1) = V<*)(I r - A(D 1 W) D^)V( t ) g y 0. Clearly, Rank(I r - A(p 1 (t)) ) < j-W - 1, as a result, 

3 Note that the RRP procedure in 1171 cannot be directly applied to our problem. This is because in 1171 . the RRP is used to identify rank-1 solution 
of semidefinite programs with linear objective and constraints. Our problem is different in that the objective function is of a logdet form. 
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Rank(Wm ) < Rank(Wm ) — 1> i-e-, the rank has been reduced by at least one. Utilizing ([T7ii-([T9il, we obtain 



h H W (t+1) b — TrPFT W*' +1) l — Tr H V (t) (T - Tl^'IV 

u m,m vv m "m,m — H[n m , m VV m J — II n m , m V \(Y)(t)^ 

— TWH w'*'l — h H w'''h H 

— 11 L J - J -m,m » V m J — 11 m,?n vv m 1A m,m 



(20) 



Tr[A m W^ +1) ] = Tr A m V (t) (I r 



1 




H 



Tr[A m w£?] 



(21) 



A(DW) 



Tr[W^ +1) ] =Tr V (t) (I r 



1 




ii 



Tr[W^]. 



A(D(*)) 



(22) 



Equation d20b and ( f2TT > ensure that the objective value of (R-LBM) does not change, i.e., t/ m (Wm +1 ',W_ m ) 



U m (W$, W- m ). Equation ([22j ensures Tr[W^ +1) ] = Tr[W$] < p m . Combined with the fact that W^ +1) >r 0, we 



Evidently, performing the above procedure for at most r times, we will obtain a rank-1 solution W* n that solves the 
problem (LBM). Now the question is that under what condition can we find that satisfies (TT7T> — (fT9b in each iteration 
t. Note that D' 1 ' is a r^> x r^> Hermitian matrix, hence finding that satisfies dT7b — (fT9l> is equivalent to solving a 
system of three linear equations with (R^) 2 unknowns 0. As long as (i?(*') 2 > 3, the linear system is underdetermined 
and such can be found. Consequently, the RRP procedure, when terminated, gives us a W,*„ with Rank 2 (W;j < 3. 
As the rank of a matrix is an integer, we must have Rank(W*J = 1. It is important to note, however, that the ability of 
the RRP procedure to recover a rank-1 solution for problem (R-LBM) lies in the fact that we only have three linear terms 
of W r „ in both the objectives and the constraints. This results in solving a linear system with three equations in each 
iteration of the RRP procedure. If we have an additional linear constraint of the form Tr(BW TO ) < c for some constant 
c, the RRP procedure may produce a solution W* n with Rank 2 (W^) < 4, which does not guarantee Rank(W^) = 1, 

We have used the RRP procedure to identify the structure of problem (R-LBM) that allows for the existence of a 
rank-1 solution. However in practice this procedure is not that useful as it requires solving (R-LBM) to begin with. 
Therefore we will use our own algorithm listed in Table|I]to directly get a rank-1 solution of (R-LBM). Summarizing the 
above discussion, we propose the following algorithm, named Successive and Sequential Convex Approximation Beam 
Forming (SSCA-BF): 

1) Initialization: Let t = 0, randomly choose a set of feasible co variances W^, V to G M. 

2) Information Exchange: Choose m = M t, let each BS q ^ to compute and transfer T,(W') to BS to. 

3) Maximization: BS to use the procedure in Table Q] to obtain a solution W'+ : of problem (LBM) with the objective 
function (7 m (W m ,W*_ m ). Let W t+1 = [W*+\W*_ m ]. 

4) Continue: If |i?(W* +1 ) - J?(W t+1 - M )| < e, stop. Otherwise, set t = t+ 1, go to Step 2). 

In Step 4), e > is the stopping criteria. The above algorithm is distributed in the sense that as long as the BS to 

have the information specified in Step 2) and the channels {H m ,q}q^m, it can carry out the computation by itself. 

Theorem 1: The sequence {i?(W')} produced by the SSCA-BF algorithm is non-decreasing and converges. Moreover 

every limit point of the sequence {W} is a stationary solution to the problem (SRM). 

Proof: Fix a iteration t and let to = M 0t. Due to the fact that we are able to solve the problem (LBM) exactly, 

4 The number of unknowns for the real part of D'*' is — — — -, and the number of unknowns for the imaginary part of D'*) is — — ■ 



have that Wm is also an optimal solution to the problem (R-LBM). 
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we have [/ m (W^\ W*_ m ) > C/ m (W*„). Using O and the fact that U m (W^) = i?(W*„), we have 

R(W t+1 ) = R(W t + 1 ,Wt rn ) > U m (W t + 1 ,Wt m ) > U m (yV* m , W*_ m ) = i?(W). (23) 

Clearly the system sum rate is upper bounded, then the sequence {i?(W t )}^ 1 is nondecreasing and converges. Take 
any converging subsequence of {W'J^j, and denote it as {W Define W* = lim^oo W ( . For all BS m 6 M., 
we must have U m (W* m ,W*_ m ) > U m (W m ,W*_ m ), V W m e T m , i.e., 

e arg max U m (W m , W* J, V m e M. (24) 

Checking the KKT conditions of the above M optimization problems, it is straightforward to see that they are equivalent 
to the KKT condition of the original problem (SRM). It follows that W* is a KKT point of the problem (SRM). In 
summary, any limit point of the sequence {W'j^j is a KKT point of the problem (SRM). ■ 

IV. Multi-cell Network with Multiple Users In Each Cell 

In this section, we consider the network with multiple users per cell. In this scenario, we can no longer perform the 
SSCA-BF algorithm cyclicly among all the users to maximize the system sum rate. The reason is that different users 
in the same BS share a coupled constraint Tr(^\ gjV W m ,j) < p m . For example, consider a network with a single BS 
m and multiple users. Suppose at time 0, W" n i = 0, V i £ M m - Suppose BS m optimizes user (m, 1) first (solving 
problem (LBM) for user (m, 1) with constraints Tr(W m ,i) +Tr(^. ,j j£Af W^j) < p m and W rn .i >z 0). The covariance 
so obtained has the form 1 = p m _^ip__i^p < an( j must have the property Tr(W^ i) = p m - Then all the subsequent 
computations (t = 1, • • • ) within BS m yields W* t i = 0, V i ^ 1, because each of the problem has to satisfy the joint 
power constraint. 

In order to avoid the above problem, we propose to compute the covariance matrices BS by BS, instead of user by 

user, i.e, to update the set W m = {W m .i}i£Af m at the same time, and cycle through the BSs. To this end, we first 

identify a set of per-BS lower bounds that will be useful in the subsequent development. 
Proposition 2: For all feasible W m and a fixed W we have the following inequality 

R m (W m ,W- m ) + R-m (w) - Y.Y. Tr (w) H m , 9j (W m ,i - Wm,i)] < flCWm, W_ m ) (25) 

where the equality is achieved when W m = W m . Define the left hand side of ( 125b as C/ m (W m , W_ m ), which is the 
lower bound associated with BS m. 

Proof: We can verify, similarly as in PropositionQ] that R- m (W m , W_ m ) is jointly convex with the set of matrices 
{W mj ,}, e Af m ■ Then the lower bound in ( f25l ) can be obtained by Taylor expansion. Due to space limit, we do not reiterate 
the proof here. ■ 
Unfortunately, unlike the lower bound U m {-) obtained for the single user per BS case, U m (.) is not concave in W m , 
due to the non-concavity of 7? m (W m , W_ m ) w.r.t. W m . In the following, we propose a heuristic algorithms to optimize 
the per-BS lower bound. 

We first express the lower bound U m (W m ,'W- m ) in an equivalent form (where w TO = {"w m ,i}ieN' m ) 

Pm(w m ,W- m ) = iJm(w m ,W-m) + fi-m (w) - ^ ^ ^ T q ,j (w) ( W^Hm,^ Wm,i — W^jHm,^ Wm.ij ■ 
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Then individual BSs' lower bound optimization problem is 

max U m ( ) (26) 



Take the derivative of the Lagrangian of the problem d26j > w.r.t. w mi to be zero, we obtain 

ln(2) ^2 ^2 w_q)H m ,q 3 + ^ T m ,i(w m , w_ m )H m , m , + ^ m I p w 



,Vi£ATm (27) 



where /i m > is the dual variable associated with the power constraint, and T m j (w m ,w_ m ) is defined 

l/ln(2) 



T ra ,i (w m) W_ m ) 



W m>mi H m) ;W mj i 



(28) 



A tuple (/i,„,w.j) that satisfies the N equations in ( l27l ) as well as the complementarity and feasibility conditions 
Mm > 0, /i m (p m - Z)ieAT m w m,i w m,i) = and p m - Y,ieH m w m,i w "M > is a stationary solution to the problem 
Let us define 



M m ,i(jU m , w) = ln(2) ^2 r,j(w m ,W- m )H m , 9j +MmI P • (29) 

\(9,i)#(m,i) / 

It is shown in lfl3l Proposition 1] that the optimal beam vector w m .i that satisfy ( f27b must satisfy the following identity 

(30) 

for some constant f3 m ,i{^m) that can be computed as 



I) 



(31) 



\ (h« >m .M] 7l!i (^ m ,w)h m , mi ) 2 
As a result, we can compute {~Wm,i}ieN m by first computing f3 m ,i(fi m ) according to OTb . and then use bisection 
(similarly as in the classic water filling algorithm) to find an appropriate [i m > such that the power constraint for BS 
m is satisfied. To this end, we propose a Sequential Beamforming (S-BF) algorithm: 

1) Initialization: Let t = 0, randomly choose a set of feasible transmission beams w^, V m € M.. 

2) Information Exchange: Choose m = {(t + l)mode(M)} + 1, let each BS q ^ rn compute and transfer 
{28,j(w*)}j£jv to BS m through the backhaul network. 

3) Computation: BS m updates its beam vectors according to < T3Qb and OTb . with w = w'. Use bisection to find \i m 
that ensures the power constraint. Obtain the solution w m . 

4) Update: If C/ m (w^,w'L m ) > J7 m (w*) Set w t+1 = [w m ,w'_J; otherwise Set w t+1 = w*. 

5) Continue: If |i?(w t+1 ) - i?(w t+1 ~ M )| < e, stop. Otherwise, set t = t + 1, go to Step 2). 

Note that in Step 4) we check if the lower bound is increased. If this is indeed the case, we accept the new set of 
beams wj„. This procedure ensures i?(w t+1 ) > i?(w*). 
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The S-BF algorithm is a variant/extention of the the ICBF algorithm proposed in ff3l : Step 2) and Step 3) of S-BF 
is a sequential version of the ICBF algorithm. However, the S-BF algorithm does have several advantages/differences to 
the ICBF algorithm: i) The ICBF tries to solve the KKT system of the problem (SRM), while S-BF tries to optimize 
the per-BS lower bound for each BS; ii) In S-BF algorithm the BSs update sequentially while in the ICBF algorithm 
the BSs update at the same time. One important consequence of such difference in updating schedule is the amount 
of information exchange needed in each iteration: in our algorithm, all BSs only need to send a single copy of their 
local information to a single BS, while in ICBF algorithm, they need to send to all other BSs. As will be shown in 
Section [Vj the total information exchange needed for both S-BF and SSCA-BF algorithm is significantly less than the 
ICBF algorithm; Hi) Due to the utilization of the per-BS lower bound in Step 4), the system sum rate of the proposed 
S-BF algorithm monotonically increases and converges, while the ICBF algorithm does not possess such convergence 
guarantee; iv) In S-BF algorithm, there is no "inner iteration", in which all the BSs update their beam vectors at the same 
time to reach some intermediate convergence (note that in ICBF algorithm, the convergence of the inner iteration is not 
guaranteed). Such "inner iteration" is undesirable, because a) it is hard to decide on, in a distributed fashion, whether 
convergence has been reached and b) in each of such inner iterations, extra feedback information needs to be exchanged 
between the BSs and their users. 

V. Numerical Results 

In this section, we give numerical results demonstrating the performance of the proposed algorithms. We mainly 
consider a network with a set W of BS, where |W| = 14 (see Fig. Q] for the system topology of the network with 
randomly generated user locations). 4 of the BSs are coordinated for transmission (in the set VW), i.e., M = 4. All 
other BSs' (in the set W/A4) transmission is regarded as noise. The BS to BS distance is 2 km. Let d qyTni be the 
distance between BS q and ith user in mth cell. The channel coefficients are modeled as zero mean circularly symmetric 
complex Gaussian vector with (200/d giTOi ) 3 ' 5 L q _ mi as variance for each part, where 10 log 10(L q , mi ) is a real Gaussian 
random variable modeling the shadowing effect with zero mean and standard deviation 8. The environmental noise 
power is modeled as the power of thermal noise plus the power of noises/interferences generated by non-coordinating 
BSs: c TOj j = er 2 + J2 w ew-M ' 5 L w ^ ni p w . We take p m = 1 for all m £ W, and define the SNR as 

101ogl0(p m /er 2 ). The stopping criteria is set to be e = 10 2 for all the algorithms. 

In Fig. |2] and Fig. [3] we consider networks with N = K = 5 and N = K = 10, where the the users i £ Mm that 
are associated with BS m are uniformly placed within d m . mi £ [200, 1000] meters. We show the sum rate performance 
of the S-BF algorithm comparing with the ICBF algorithm in lfl~3l and the non-coordinating schemes where the BSs 
individually perform zero forcing beamforming and channel matched filter beamforming. In Fig. |4]we consider network 
with N = K = 5 and d mymi £ [200, 300], V m,i. Clearly all the coordinated schemes achieve similar throughput 
performance, which is significantly higher than the non-coordinated schemes. 

We then compare the amount of inter-cell information needed for different coordinated schemes. We define the unit of 
information transfer as the total information needed from the set of coordinated BS for updating the beam vectors for a 
single BS m £ Ai. Clearly, in each iteration of the S-BF algorithm, a single unit of information is needed to go through 
the backhaul network, while in ICBF algorithm, M units of information are needed. In Fig. and Fig. [6] we demonstrate 
the averaged number of iterations and the averaged total units of information needed for different coordinated schemes 



November 29, 2011 



DRAFT 



II 



400C 



Coordinated BS 
-e- Mobile Users 
-0-Non-Coordinated BS 



000 3000 4000 5000 6000 

[m] 



7000 8000 9000 




--- ICBF 
-0-Zero Forcing 
-^Channel Match 
□ S-BF 



> — > — & — e> — e> — e> — > — 1> 



4> — — — o — — — — <> 



30 40 50 

SNR (<JB) 



60 



70 



Fig. 1. Topology of simulated network. 
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until convergence. We observe that the total units of information needed for the proposed SSCA-BF and S-BF algorithms 
are around 25% less than the ICBF algorithm when M = 4, and around 40% less when M = 9. |f| We also emphasize 
that typically, several inner iterations are needed per outer iteration of ICBF, and we have not count the extra information 
needed between the BSs and the users in these inner iterations. As a results, in Fig. and Fig. [6] we see that the total 
iterations needed for ICBF algorithm are close to the S-BF algorithm. In all the simulations presented above, the results 
are obtained by averaging over 500 randomly generated user locations and channel realizations. 

VI. Conclusion 

In this correspondence, we study the sum rate maximization problem using beamforming in a multi-cell MISO network. 
We have explored the structure of the problem and identified a set of lower bounds for the system sum rate. For the case 
of a single user per cell, we proposed an algorithm that reaches the KKT point of the sum rate maximization problem. 

5 The network with M = 9 is generated similarly as the case of M = 4, i.e., the center 9 BSs are coordinating, while the other BSs around them 
are non-coordinating and their transmissions are considered as noises. 
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Fig. 5. Comparison of the Number of Iterations/Information Units Needed Fig. 6. Comparison of the Number of Iterations/Information Units Needed 
for Convergence. K = 5, N = 5, M = 4. for Convergence. K = 5, N = 5, M = 9. 

For the case of multiple users per cell, we propose and algorithm that achieve high system throughput with reduced 
backhaul information exchange among the BSs. 
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